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This application is a 371 of PCT/ JP00/009348 , filed 
December 27, 2000. 

TECHNICAL FIELD 

The invention of this application relates to a probe for 
protein-protein interaction analysis, and to a method for the 
analysis of protein-protein interactions using such a probe. 
More specifically, the invention of this application relates to a 
probe for protein-protein interaction analysis and to a method 
for the analysis of protein-protein interactions using such a 
probe, which enable accurate and simple analysis of protein- 
protein interactions in all living cells. 

BACKGROUND ART 

Protein-protein interactions are known to play an important 
role with respect to structure and functions of living cells. 

Many problems currently studied in molecular biology and 
biochemistry, such as gene transcription mechanism and 
intracellular information signaling, are related to protein- 
protein interactions. 

Some of the problems in the field of molecular biology and 
biochemistry have been heretofore solved by the development of a 
two-hybrid method (Chien, C. T., Bartel, P. L., Sternglanz, R., 
Fields, S., Proc. Natl. Acad. Sci. USA 1991, 88, 9578-9582; 
Fields, S., Song, 0., Nature 1989, 340, 245-246) by which 
screening is conducted based on interaction of a "bait" protein 
and a "prey" protein in a protein library. The two-hybrid method 
has been suggested as an effective method that facilitates the 
identification of candidate molecules for protein-protein 



interaction and also creates new protein-protein interaction maps 
(Flores, A., Briand, J. F. , Gadal, 0., Andrau, J. C, Rubbi, L., 
Mullem, V., Boschiero, C, Goussot, M., Marck, C, Carles, C, 
Thuriaus, P., Sentenac, A. , Werner, M. , Proc. Natl. Acad. Sci. 
USA 1999, 96, 7815-7820; Ito, T., Tashiro, K., Muta, S., Ozawa, 
R., Chiba, T., Nishizawa, M . , Yamamoto, K., Kuhara, S., Sakaki, 
Y., Proc. Natl. Acad. Sci. USA 1999, 97, 1143-1147; Walhout, A. 
J. M. , Sordella, R., Lu, X., Hartley, J. L., Temple, G. F., 
Brasch, M. A., Thierry-Mieg, N . , Vidal, M . , Science 2000, 287, 
116-122) . However, the problem associated with the two-hybrid 
method is that it is applicable only to the analyzable protein 
interactions that occur close to the reporter gene in cell nuclei 
and, therefore, this method lacks generality. 

Reliability is yet another problem associated with the two- 
hybrid method, and a confirmation test using proteins with known 
functions has to be conducted by employing model cells or animals 
for every assay (Walhout, A. J. M. , et al., Science 1999, 287, 
116-122) . 

Accordingly, novel methods for protein-protein interaction 
analysis have been suggested, those methods including a ubiquitin 
split protein sensor (USPS) method by which N- and C-terminal 
ubiquitins through reactions with proteins and a nuclear- 
localized reporter is activated by cleavage of a transcription 
factor (Dunnwald, M., Varshavsky, A., Johnsson, N., Mol. Biol. 
Cell 1999, 10, 329-344; Johnsson, N., Varshavsky, A., Proc. Natl. 
Acad. Sci. USA 1998, 95, 5187-5192; Stagljar, I., Korostensky, 
C, Johnsson, N., Heesen, S., Proc. Natl. Acad. Sci. USA 1998, 
95, 5187-5192; and an SOS recruit system, in which a catalyst 
domain is brought close to a membrane localization domain through 
protein-protein interaction, a guanine exchange factor (GEF) or 
Ras is reconstructed, and it further complements yeast 
temperature-sensitive mutated yeast GEF (Aronheim, A., Nucleic 
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Acids Res. 1997, 25, 3373-3374; Aronheim, A., Zandi, E., 
Hennemann, H., Elledge, S. J., Karin, M. , Mol. Cell. Biol. 1997, 
17, 3094-3102; Broder, Y. C, Katz, S., Aronheim, A. , Curr. Biol. 
1998, 8, 1121-1124) . 

As a more general approach, a split enzyme technology 
method has been reported (Rossi, F. , Charlton, C. A. and Blau, H. 
M., Proc. Natl. Acad. Sci. USA 1997 , 94, 8405-8410; Remy, I., 
Michnick, S. W., Proc. Natl. Acad. Sci. USA 1999, 96, 5394-5399; 
Pelletier, J. N., Arndt, K. M. , Pluckthun, A., Michnick, S. W., 
Nature Biotech. 1999, 17, 683-690). With this method, a split 
enzyme is reconstructed through protein-protein interaction and 
its enzymatic activity is restored. The activity of the 
reconstructed enzyme can be measured by fungi or cellular 
phenotype or by a fluorescent enzyme substrate analogues. 

These methods have a comparatively high accuracy and attain 
the object of assaying interactions between intracellular 
proteins or membrane proteins, but they are applicable to only 
appropriately designed cells. Another problem is that sufficient 
accuracy and sensitivity cannot be obtained. Still another 
problem is that such methods require a variety of substrates to 
conduct the analysis and are therefore troublesome. 

Therefore, a general method or probe suitable for accurate 
and simple analysis of protein-protein interactions and 
applicable to any protein has not yet been developed. 

With the foregoing in view, it is an object of the 
invention of the present application to solve the problems of the 
prior art and to provide a probe for protein-protein interaction 
analysis, which enables highly accurate and simple analysis of 
protein-protein interactions, and to a method for the analysis of 
protein-protein interactions using such a probe. 

SUMMARY OF THE INVENTION 

In order to solve the above-mentioned problems, the 
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invention of this application firstly provides a probe for 
protein-protein interaction analysis, this probe serving to 
analyze the interaction between two proteins, wherein protein 
splicing is induced by protein-protein interaction and a 
physicochemically or biochemically detectable protein is 
regenerated. 

Secondly, the invention provides the probe for protein- 
protein interaction analysis, comprising two probes: a probe "a" 
which comprises an N-terminal polypeptide of an intein and an N- 
terminal polypeptide of a labeled protein, and a probe "b" which 
comprises a C-terminal polypeptide of the intein and a C-terminal 
polypeptide of the labeled protein. 

Thirdly, the invention provides the probe for protein- 
protein interaction analysis, wherein the C-terminal of probe "a" 
and the N-terminal of probe "b" each has a linker sequence. 

Fourthly, the present invention provides the probe for 
protein-protein interaction analysis, wherein the intein is an 
endonuclease derived from yeast VMA; and fifthly, the invention 
provides the probe for protein-protein interaction analysis, 
wherein the intein is DnaE derived from cyanobacterium. 

Sixthly, the invention provides the probe for protein- 
protein interaction analysis, wherein the indicator protein 
(e.g., labeled protein) is a fluorescent protein; and seventhly, 
the invention provides the probe for protein-protein interaction 
analysis, wherein the fluorescent protein is a green fluorescent 
protein (GFP) . 

Eighthly, the invention provides the probe for protein- 
protein interaction analysis, wherein the indicator protein is an 
emission-catalyzing enzyme; and ninthly, the invention provides 
the probe for protein-protein interaction analysis, wherein the 
emission-catalyzing enzyme is a lucif erase. 

Tenthly, the invention provides a method for the analysis 
of protein-protein interactions, comprising the steps of causing 
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a protein linked to the probe "a" of any of the above-described 
second to ninth claims to be co-present with a protein linked to 
the probe "b" of any of the above-described second to ninth 
claims, and detecting a signal of the labeled protein. 

Eleventhly, the invention provides the method of protein- 
protein interaction analysis, wherein the protein linked to the 
probe "a" is caused to be co-present with the protein linked to 
the probe "b" by introducing a polynucleotide that expresses the 
probe for protein-protein interaction analysis of any of the 
above-described first to ninth claims into an eukaryotic cell. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a schematic drawing illustrating an example of 
the structure and operation principle of the probe for protein- 
protein interaction analysis of the present invention. Here, (I) 
illustrates co-presence of protein pairs and probes; (II) 
illustrates protein-protein interaction; (III) illustrates 
protein splicing; and (IV) illustrates indicator protein linkage. 
Furthermore, (la) denotes the probe "a" for protein-protein 
interaction analysis; (lb) denotes the probe "b" for protein- 
protein interaction analysis; (2a) denotes the N-terminal 
polypeptide of an intein; (2b) denotes the C-terminal polypeptide 
of the intein; and (3) denotes a labeled protein. In particular, 
(3a) denotes the N-terminal polypeptide of the labeled protein; 
and (3b) denotes the C-terminal polypeptide of the labeled 
protein. (4a) denotes protein (or a protein site) A; and (4b) 
denotes protein (or protein site) B. 

Fig. 2 shows amino acid residues substituted between 1124 
and 1129 with respect to the structure of the N-terminal 
polypeptide of EGFP in the Example of the present invention. 

Fig. 3 shows the results of SDS-PAGE analysis of proteins 
expressed in E. coli and transformed with pGEX-NVC. Here, lanes 
(a) to (c) are Coomassie Blue-stained SDS-PAGE; lane (a) 
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indicates the protein molecular mass standard (Novagen) with the 
molecular mass thereof (kDa); lane (b) indicates the crude 
product before a GST-affinity column; and lane (c) indicates the 
sample purified by the GST-affinity column. Lanes (d) and (e) 
show Western blotting analysis of a crude lysate; (d) shows the 
analysis results obtained with the antibodies specific to VDE and 
(e) shows the results of analysis obtained with the antibodies 
specific to GFP (e) . 

Fig. 4 is a schematic representation of the structure of a 
plasmid (probe <1> for protein-protein interaction analysis) 
constructed in the Example of the present invention. Here, the 
restriction endonuclease splicing sites are shown above the 
dotted lines indicate the endonuclease domain in the VDE intein, 
the open bars indicate the VDE intein with no endonuclease, the 
black bars indicate GST labeling, and the hatching indicates His 
labeling. Furthermore, the lattice patterns indicate a linker, 
"stop" indicates a translation termination codon, and "start" 
indicates a translation initiation codon. 

Fig. 5 shows the splicing results obtained with interaction 
of protein CaM and protein M13 in the Example of the present 
invention. Here, (a) indicates pET_NVCASD (C/M) (SDS-PAGE) ; (b) 
indicates pET_NVCASD (M/C) ; (c) indicates pET_NVCASD (/) gene 1; 
(d) indicates pET_NVCASD linker (C/M) ; (e) indicated pET_NVCASD 
linker (M/C); and (f) indicates pET_NVCASD linker (/) . (A) shows 
the results of Western blotting obtained with anti-His labeling, 
and (B) shows the results of Western blotting obtained with anti- 
GFP labeling. 

Fig. 6 shows the fluorescence spectra of crude products 
from E. coli having plasmids (probe <1> for protein-protein 
interaction analysis) constructed in the Example of the present 
invention (excitation: 470 nm, band width: 5.0 nm; and the 
emission band width: 5.0 nm) . (a) indicates NVCASD linker (/) ; 
(b) indicates NVCASD (C/M); and (c) indicates NVCASD linker 
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(C/M) . 

Fig. 7 shows the structure of a plasmid used in the Example 
of the present invention. Here, the dotted lines indicate an 
intracellular ribosome entry site (IRES); the cDNAs of pLucAll, 
pLucN and pLucC are inserted into pcDNA3.1 (+) ; "Stop" and 
"Start" indicate translation termination and initiation codons, 
respectively. 

Fig. 8 shows the emission intensity of LucN alone, LucC 
alone and LucAll in the Example of this invention. 

Fig. 9 shows the enhancement of protein splicing by insulin 
in the Example of this invention. 

Fig. 10 shows the influence of amino acid mutation in the 
Example of this invention. 

Fig. 11 shows the insulin concentration dependency of Lum__F 
emission intensity in the Example of this invention. 

DETAILED DESCRIPTION OF THE INVENTION 

The probe for protein-protein interaction analysis of the 
present invention is based on the principle that the interaction 
of two proteins, each being linked to a probe, induces splicing, 
the sites of split labeled proteins contained in the probes are 
linked and regenerate the labeled proteins, and a signal is 
emitted. 

Protein splicing is a process wherein the internal protein 
segment (intein) is excised from a translated protein. In this 
process, the excision of intein is accompanied by the ligation of 
flanking sequences (exteins) (Gimble, F. S., Sci. Biol. 1998, 5, 
R251-256) . 

With the probe for protein-protein interaction analysis of 
the present invention, protein-protein interaction analysis is 
conducted by ingeniously applying such self -excision of inteins. 

In other words, the probe for protein-protein interaction 
analysis of the present invention consists of two probes "a" and 
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"b" , each probe being connected to a respective protein. Fig. 1 
is a schematic representation of the principle of the probe for 
protein-protein interaction analysis of the present invention. 

Of the two probes "a" (la) and "b" (lb), probe "a" (la) 
comprises the N-terminal polypeptide of an intein (2a) and the N- 
terminal polypeptide of a indicator protein (3a), and probe "b lf 
(lb) comprises the C-terminal polypeptide of the intein (2b) and 
the C-terminal polypeptide of the indicator protein (3b). 

The two probes "a" (la) and "b" (lb) are linked to the 
respective proteins A (4a) and B (4b) , , to analyze the protein A 
- protein B interaction. Hence, when protein A (4a) and protein 
B (4b) linked to probe "a" (la) and probe "b" (lb) are co- 
present (I) and when the two proteins (4a, 4b) interact with each 
other (II), the intein (2) is excised through splicing (III). As 
a result, the indicator protein sites (3a, 3b) bonding to the 
intein (2) are ligated (IV) . Therefore, if the obtained 
indicator protein (3) is analyzed, the protein-protein 
interaction can be confirmed. 

In this process, if protein A (4a) does not interact with 
protein B (4b), excision of intein (III) and splicing of exteins 
do not occur; therefore, the indicator protein (3) is not 
regenerated (IV) and cannot be detected. 

For the probe for protein-protein interaction analysis of 
the present invention, probes "a" and "b" (la and lb) may each be 
composed only of the intein polypeptide and the indicator protein 
polypeptide, but may also additionally contain a linker sequence 
or the like. Accordingly, in probes "a" and "b" (la and lb), 
the intein polypeptides (2a, 2b) and the indicator protein 
polypeptides (3a, 3b) may be directly bound to each other, or may 
be bound via a polypeptide such as a linker sequence. 

Well-known inteins derived from various organisms can be 
employed as the intein. Suitable examples include those derived 
from eukaryotes such as Saccharomyces cerevisiae (yeast) See VMA, 
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Candida tropiallis {Candida fungi) Ctr VMA; those derived from 
eubacteria such as Mycobacterium tuberculosis (tubercle bacillus 
Mtu recA; and those derived from paleobacteria such as 
Thermoplasma asidophilum Tac VMA. In addition, Cyanobacterium 
synechocystis sp. (cyanobacterium) DnaE may also serve as the 
intein. 

When protein A (4a) interacts with protein B (4b) in the 
probes (la, lb) for protein-protein interaction analysis of the 
present invention, the inteins are preferably site-specific 
endonucleases to provide for autocatalytic intein excision. 

Specifically, intein derived from yeast VMA and Ssp DnaE 
intein derived from cyanobacterium are the preferred examples. 
In yeast VMA, the nascent translation product, 120-kDa VMA1 , 
catalyzes protein splicing to give a 70-kDa H + -ATPase sub-unit 
and a 50-kDa site-specific endonuclease (VDE, or Pl-Scel) . This 
VDE is preferably used as the intein sites (2a, 2b) in the probe 
(la, lb) for protein-protein interaction analysis. For the Ssp 
DnaE derived from cyanobacterium, the DNA sequence of the strain 
PCC6803 has been clarified (the N-terminal has 123 amino acid 
residues and the C-terminal has 36 amino acid residues) ; 
moreover, it is a natural split intein, and ligation of N- and C 
extein is known to occur therein (Wu, H., Hu, Z., Liu, X., XU., 
M., Q. , Proc. Natl. Acad. Sci. USA 1998, 95, 9226-9231). 
Therefore, Ssp DnaE intein is easy to handle, and preferred as 
the intein sites (2a, 2b) in the probes (la, lb) for protein- 
protein interaction analysis. 

Of the various inteins mentioned above, the cyanobacterium 
derived Ssp DnaE intein is preferred, because the protein-protei 
interactions in mammal cells can also be detected with high 
sensitivity. Needless to say, other known or novel inteins may 
also be employed. 

For effective excisions of intein and splicing (III) (la, 
lb) of the present invention, probes la and lb for protein- 
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protein interaction analysis have to be correctly folded and 
sites have to be correctly aligned, so that the two sites 
participating in protein splicing be adjacent to each other 
(Duan, X., Gimble, F. S. and Quiocho, F. A. , Cell 1991, 89, 555- 
564) . Accordingly, the inteins derived from organisms may be 
directly used or they may be designed so as to facilitate the 
splicing, for example, by transforming or deleting some of the 
amino acid residues, or by introducing a suitable linker 
sequence . 

For example, for the above-mentioned VDE, a mutant obtained 
by deleting the endonuclease domain and substituting with a 
flexible dodecapeptide linker is known to demonstrate a high 
splicing activity (Cooper, A. A., Chen, Y. J., Lindorfer, M. A., 
Stevens, T. H., EMBO J. 1993, 12, 2575-2583; Chong, S., Xu, M. - 
Q. , J. Biol. Chem. 1997, 272, 15587-15590). On the other hand, 
the cyanobacterium-derived Ssp DnaE is a naturally split intein, 
as mentioned above, and is known to fold correctly and splice the 
exteins. The above-mentioned linker sequence may or may not be 
introduced into dnaE 

The above-described intein is split into the N-terminal 
polypeptide (2a) and the C-terminal polypeptide (2b) . In the 
probes (la, lb) for protein-protein interaction analysis of the 
present invention, the split polypeptides (2a, 2b) are bound to 
the N-terminal polypeptide (3a) of a indicator protein and the C- 
terminal polypeptide (3b), respectively, to become a probe. 

On the other hand, in the probes (la, lb) for protein- 
protein interaction analysis of the present invention, the 
indicator protein sites are directly peptide bonded and linked 
(IV) when the intein is spliced out and probes "a" and "b" (la, 
lb) (III) are spliced by the interaction ( II ) of proteins A and B. 
The indicator protein (3) may be any protein that can be analyzed 
again by the linkage (IV) . For example, fluorescent protein and 
emission-catalyzing enzyme are preferably employed. Fluorescent 
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proteins such as green fluorescent protein (GFP) are preferred 
because they emit light after protein linkage and can be analyzed 
visually. Emission-catalyzing enzymes such as luciferase are also 
preferred because they form active centers after linkage and emit 
light that can be easily detected with a luminometer. For the 
split N- and C-terminals of luciferase to give individually no 
fluorescence and to restore the activity again after bonding, the 
luciferase must be split so that its active center is divided 
into two. The luciferase enzyme is known to be folded into two 
domains that sandwich a broad comprising an active center, one 
being a large N-terminal domain comprising one p-barrel and two 
(3-sheets and the other is a C-terminal site (Waud, J. P., Sala- 
Newby, G. B., Matthews, S. B., Campbell, A. K., Biochim. Biophys. 
Acta 1996, 1292, 89-98; Conti. E. Franks, N . P., Brick, P., 
Structure 1996, 4, 287-298). Accordingly, it is preferred that 
the enzyme luciferase be split into 3a and 3b[[,]] at the 
flexible site where the two domains are linked. With respect to 
the luciferase enzyme, it is also known that the N-terminal 
polypeptide (3a) has a cysteine (Cys) residue in the bonding site 
with the N-terminal polypeptide (2a) of the intein, has a tyrosine 
(Tyr) residue upstream of -1 position thereof, and has alanine 
(Ala) and phenylalanine (Phe) in the -3 and -4 positions, 
respectively, thereby enabling even more efficient splicing. 
Therefore, effective splicing may be induced by producing a 
mutant (R437C) in which the 437th arginine residue is converted 
into cysteine, a mutant (D436Y) in which the 436th asparagic acid 
residue is converted into tyrosine, and a mutant (I434A) in which 
the 434th isoleucine residue is converted into alanine. 

As mentioned hereinabove, the invention of this application 
provides the probe for protein-protein interaction analysis. 
Furthermore, one probe (e.g., probe "a") is linked to one 
protein (protein A) whose interaction is to be confirmed, the 
other probe (probe "b" ) is linked to the other protein (protein 
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B) whose interaction is to be confirmed, and the two probes are 
made to coexist in a close proximity, thereby enabling the 
analysis of interaction between proteins A and B based on the 
above-mentioned principle and mechanism. 

The proteins (4a, 4b) may be linked with the probes (la, 
lb) by any method, as long as the proteins and the probes are not 
affected. For example, any ordinary chemical, biochemical or 
genetic engineering techniques can be employed. 

Any method may be employed for detecting and analyzing 
protein-protein interactions (light emission) by using the probe 
for protein-protein interaction analysis of the invention. 
Experimental methods and detectors generally used in the field of 
chemistry and biochemistry may be employed. For example, 
detection and analys is can be easily conducted by using a 
luminometer . 

Embodiments of the invention are described in more detail 
in the following Examples with reference to the drawings 
attached. It goes without saying that the invention is not 
limited to these Examples, and various changes and modifications 
may be made. 

EXAMPLES 

Example 1: Construction of Probe <1> for protein-protein 
interaction analysis 

A yeast VMAl-derived intein (VDE) was used as the intein 
site of probe <1> for protein-protein interaction analysis. 

Of the 454 amino acids constituting VDE, Cysl is taken as 
the first amino acid residue and Asn454 is taken as the last 
amino acid residue. The C-extein starts from Cys455, and the 
extein residue adjacent to Cysl is numbered -1. The numbers then 
increase as -2, -3, . . . toward the N_Extein. 

For the labeled site, a green fluorescent protein derived 
from light-emitting jelly fish (Aequorea victoria) (EGFP, 
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described, for example, in Current Biology 1996, 6 (2), 178-182) 
was used. 

In probe <1> for protein-protein interaction, one Cys 
residue, a Gly residue upstream of the -1 position, and three 
hydrophobic amino acid residues at -5, -4 and -3 positions have 
to be present in the N-terminal polypeptide of EGFP as the 
joining sections for splicing. 

Accordingly, the amino acid residues between 1124 and 1129 
where the N-terminal polypeptide structure of EGFP is relatively 
stable were substituted as follows (Fig. 2) . 

(1) Mutation of I129C and E125I of EGFP was conducted. The 
mutants showed fluorescence, and excitation and emission peaks 
were observed at 488 nm and 510 nm identical to those of EGFP. 

(2) Fluorescence disappeared with L126Y mutation at ml25 , 
although the expression level of the mutant was the same. 

The results indicated that the ml29EGFP mutant was folded 
incorrectly and/or splicing sites of the fluorescent protein 
sites could not be linked due to L126Y mutation. 

Accordingly, a ml25EGFP mutant for which I129C and E125I 
mutation was performed was used as the indicator protein site of 
the probe for protein-protein interaction. This indicator protein 
site is hereinafter referred to as an EGFP mutant. 

Example 2: Confirmation of splicing in single polypeptide 

To confirm whether protein splicing occurred in a single 
polypeptide in which VDE was sandwiched between the N-terminal 
polypeptide and C-terminal polypeptide of the EGFP mutant, pGEX- 
NVC was expressed at 25°C in E. coli. 

(1) Cells of E. coli DH5a were used to express a glutathion 
S-transf erase fused protein (GST) . A plasmid that covers the VDE 
region and the N- and C-terminal polypeptides of the EGFP mutant 
was fused to the GST gene under control or a tac promoter. This 
produced a chimera protein comprising GST (26 kDa) , 125 residues 
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(13 kDa) from the N-terminal polypeptide of the EGFP mutant, VDE 
(50 kDa) and the C-terminal polypeptide (14 kDa) of the mutant . 

(2) The protein obtained was extracted from the E. coli 
cells, purified, and identified by SDS-PAGE. The indicator 
protein and 10 to 225-kDa marker (Novagen) were added and 
electrophoresis was conducted on a 12 to 15% SDS-PAGE gel. The 
gel was visualized by Coomassie Brilliant Blue staining. 

Western blotting was conducted by using anti-VDE polyclonal 
antibody, anti-His-labeled polyclonal antibody (Santa Crus 
Biotechnology) , or anti-GFP monoclonal antibody (BioRad) as the 
probe . 

All the enzymes necessary for cloning were obtained from 
Takara Biomedical and used according to the manufacturer's 
instructions manual. 

The PCR fragments were sequenced using a genetic analyzer, 
ABI310. 

The main component of the crude product was a protein of 
about 50-kDa (Fig. 3b) . It matched the size of the extein linked 
to VDE (50-kDa) and GST. In other words, this component was 
assumed to be a fused protein of GST and the N-terminal 
polypeptide of the EGFP mutant (26-kDa + 13-kDa) . 

From this result, it was found that the 103-kDa precursor 
of the fused protein was split into the 50-kDa VDE and the 53-kDa 
GST-EGFP mutant fused protein. 

In addition, the molecular weights of VDE and GST-EGFP 
mutant fused protein were evaluated by Western blotting. 

The anti-VDE and anti-GFP antibodies were specifically 
reacted with the excised 50-kDa intein (Fig. 3d) and the 53-kDa 
GST-EGFP mutant fused protein (Fig. 3e) , respectively. The 
components observed in the vicinity of 100-kDa of the unspliced 
precursor were analyzed using these antibodies. 

The GST-EGFP mutant fused protein was further identified 
with a GST-affinity column. When the crude product was passed 
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through an affinity column[[,]] and the proteins bound to the 
resin were taken out using PreScission protease and then 
subjected to SDS-PAGE (Fig. 3c), a band was observed at 25-kDa. 
This result almost matches the molecular weight of the EGFP 
mutant . 

(3) The fluorescent spectrum of the protein purified with 
the affinity column was then measured. The maximum wavelength of 
excitation and emission was 488 nm and 510 nm, respectively, 
those results matching the values for EGFP. 

The above results indicated that the VDE located in the 
center of the single polypeptide was excised by splicing, the isl- 
and C-terminal polypeptides of the EGFP mutant were linked by 
peptide bonding, and the obtained EGFP mutant folded correctly to 
form a fluorophore. 

Example 3: Effectiveness of probe <1> for protein-protein 
interaction analysis 

(1) A recombinant fused protein labeled with His at its N- 
terminal was obtained by using cells of E. coli BL21(DE3). 

In order to test protein-protein interaction, GST of 
pGEX_NVC was substituted with the His-label of a pET16-b vector, 
and the splicing functional site was split to obtain 
pET_NVCASD(/) . 

The splitting was realized by substituting the functionally 
unnecessary endonuclease motif that exists in the 185th to 389th 
amino acid region of the mutant with a cassette consisting of 
((translation termination codon) - (Shine-Dalgarno sequence) - 
(translation initiation codon) ) . 

The resulting plasmid pET_NVCASD (/) was a two-gene operon 
essentially composed of gene 1 that encodes the N-terminal 
polypeptide of EGFP and VDE (N_EGFP-VDE) and gene 2 that encodes 
the C-terminal polypeptide of VDE and EGFP (C_VDE-EGFP) . 

In order to bring the N- and C-terminal polypeptides of VDE 
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spatially close to each other during protein-protein interaction, 
a bendable peptide linker having repetitive Gly-Asn sequences was 
introduced between the translation termination codon and the 
translation initiation codon (pet_NVCASD linker (/) ) . 

In order to confirm the presence of specific protein- 
protein interaction that facilitates the splicing phenomenon 
providing EGFP in E. coli, calmodulin (CaM) and its target 
peptide M13 were selected as model proteins. 

CaM and M13 are preferred, because the structures thereof 
have been clarified by NMR (Ikura, M. , et al., Science 1992, 256, 
632-638) and, therefore, the distance between any amino acid in 
CaM and any amino acid in M13 is known. 

(2) The recombinant plasmids pET_NVCASD (C/M) , pET_NVCASD 
linker (C/M) and pET_NVCASD linker (/) were introduced into the 
cells of E. coli to obtain the corresponding fused protein. 

In order to induce the intracellular splicing, the protein 
expression was effected for 12 hours at 25°C and the cells were 
thereafter stored for 1 or 2 days at 4°C. 

Gene 1 protein expression was confirmed by an anti-His 
labeled antibody (Fig. 5A) . The main components analyzed by the 
antibody were proteins of 55-kDa (Fig. 5A a, d) , and their size 
was about the same as that of the unspliced precursor of N_EGFP- 
VDE(36-kDa) with CAM (17-kDa) or its linker (1-kDa) . 

When a lysate of E. coli having a pET_NVCASD linker (/) 
plasmid was subjected to SDS-PAGE as a control, a single band of 
N_EGFP-VDE (Fig. 5A, f) was obtained. 

The expression level of the unspliced precursor proteins 
obtained from these three plasmids was almost the same. 

On the other hand, the gene 2 protein product was 
identified by an anti-GFP monoclonal antibody, and was found to 
be the C-terminal polypeptide of EGFP. 

In addition, in the cell that expressed pETJWCASD (C/M) and 
pET_NVCASD linker (C/M) , the size of the expression protein 
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obtained from the gene operon well agreed with that of the 
precursor protein of N_VDE-EGFP (20-kDa) with M13 (3-kDa) (Fig. 
5B, a) or the linker-containing M13 (4-kDa) (Fig. 5B, d) . 
Similarly, the control plasmid expressed the expected protein 
(Fig. 5B, f) . The expression levels of all three proteins 
obtained from operon II were the same. 

Fig. 6 shows the flourescence spectra of lysates of E. coli 
cells, each carrying one of the above-mentioned plasmids. For the 
cells of E. coli comprising pET_NVCASD linker (/), no spectral 
change was observed. From the cells of E. coli having 
pET_NVCASD(C/M) and having co-expressed CaM and M13, a certain 
change in fluorescence emission at 510 nm was observed. On the 
other hand, in the co-expression of CaM and M13 where a bendable 
peptide linker was bonded to N_EGFP-VDE and C-VDE- 
EGFP, fluorescence at 510 nm changed significantly. The 
fluorescence intensity of the crude product from the cells of E . 
coli that had pET_NVCASD linker (C/M) was sufficient to 
differentiate those cells from the cells that carry the control 
plasmid pET_NVCASD linker (/) or the plasmid not encoding the 
foldable bendable linker pET_NVCASD linker (C/M) . 

These results indicated that the CaM-M13 interaction 
resulted in trans splicing and ligation of N- and C-terminal 
polypeptides of the EGFP mutant that formed the EGFP fluorophore. 

Example 4: Effect of linker in probe <1> for protein-protein 
interaction analysis 

To induce splicing, it is necessary that the N- and C- 
terminals of VDE be correctly folded. Such correct folding is 
attained only when the C-terminal N_VDE is close to the N- 
terminal C-VDE. 

Regarding CaM and M13, the distance between the N-terminal 
of CaM and the C-terminal of M13 is 50 A (from Brookhaven 
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Protein Data Bank) . This distance can be too large for N_VDE to 
be close to C-VDE and, therefore, was thought to inhibit the 
correct folding in E. coli having the plasmid pET_NVCASD linker 
(C/M) . 

However, actually, a clear increase in the fluorescence 
intensity at 510 nm was observed in plasmid pET_NVCASD linker 
(C/M) , the 10- and 9-amino acid linkers that were linked to the 
N_ and C_VDEs, respectively, provided bendability necessary for 
the VDE folding, and the conformation for effective VDE splicing 
was attained. 

Example 5: Construction of probe <2> for protein-protein 
interaction analysis 

A plasmid was constructed as described below by using E. 
coli strain DH5a as a host. 

DnaE derived from cyanobacterium Synechocystis sp. PCC6803 
was used as the intein site for a probe <2> for protein-protein 
interaction analysis. As the indicator protein for the probe, a 
wild firefly-derived luciferase (Lum-F) (pLucAll) was split 
between the 437th amino acid and the 438th amino acid and used as 
N-terminal (pLucN) and C-terminal (pLucC) . 

Fig. 7 shows the structure of the luciferase and luciferase 
segments . 

(1) Splitting of luciferase 

First, the absence of enzymatic activity in pLucN and pLucC 
was confirmed by transitionally expressing the two luciferase 
segments in human insulin receptor-overexpressing Chinese hamster 
ovarian cells (CHO-HIR) . 

Thus, the CHO-HIR cells were transfected with 1 pg of each 
plasmid (pLucAll, pLucN, and pLucC) together with 0.01 pg of a 
control plasmid (pRL-TK) , and incubated in a 12-well plate for 45 
hours. After the incubation, the emission was measured with a 
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luminometer . 

As a control, the CHO-HIR cells containing no plasmid were 
incubated under the same conditions. 

In order to correct the transfection efficiency error in 
each well of the plate, a dual-lucif erase assay using a Renilla- 
derived luciferase (Lum-R) was conducted and the transfection 
efficiency in each well was corrected. 

Fig. 8 shows the relative emission intensity (RLU) of each 
luciferase (or each luciferase segment) . 

The RLU of the CHO-HIR cells where the wild firefly 
luciferase (LucAll) was expressed was 11.4 (Fig. 8a) but the RLU 
of the CHO-HIR cells where only LucN or LucC was expressed was 
2.2 x 1CT 4 and 4.5 x 1CT 5 , respectively (Fig. 8b). Furthermore, 
the RLU (background) of the CHO-HIR cells transfected with Lum-R 
itself was 2.2 x L 10" 4 indicated that LucN or LucC alone showed 
no emission activity. 

(2) Splitting of DnaE 

As mentioned above, Synechocystis sp. PCC6803-derived DnaE 
was split into N-terminal segment of 123 amino acid residues and 
C-terminal 36 amino acid residues. 

(3) Probe <2> for protein-protein interaction 

The above-described N-terminal of DnaE in (2) was ligated 
with LucN in (1), the C-terminal of DnaE in (2) was ligated with 
LucC in (1), and then they were inserted into a multicloning site 
(MCS) of pIRES ( Invitrogen) , which is a bicistronic expression 
vector, to obtain pIRES-DSL. The resulting plasmid (pIRES-DSL) 
had two MCSs: at the 3 '-terminal of the N-terminal DnaE and at 
the 5 1 -terminal of the C-terminal DnaE (MCS-A and MCS-B, 
respectively) . Proteins or protein domains which interact with 
each other or which are to be tested for their interaction can be 
introduced into those MCSs. 

Fig. 7 shows the structure of pIRES-DSL. 
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Example 6: Effectiveness 1 of probe <2> for protein-protein 
interaction analysis 

(1) In order to confirm that probe <2> for protein-protein 
interaction analysis constructed in Example 5 acts effectively 
and induces splicing of luciferase, an oligopeptide (Y941) that 
comprises the 941st tyrosine residue of IRS-1 (which is known to 
participate physiologically in insulin signal transduction) and 
SH2N domain derived from phosphatidylinositol 3-kinase, which is 
a target protein thereof, were used (White, M. F. , Diabetologia 
1991, 40, S2-S17) . 

A plasmid pIRES-DSL (Y/S) with Y941 inserted into MCS-A and 
SH2N inserted into MCS-B and a pRL-TK vector were transiently 
coexpressed in CHO-HIR cells. 

Fig. 7 shows the structure of the pIRES-DSL (Y/S) . 

First, CHO-HIR cells were transfected with 2 pg of the 
plasmid (pIRES-DSL (Y/S)) and 0.02 ng of the control plasmid pRL- 
TK in a 6-well plate. The cells were incubated for 45 hours, and 
then the culture was substituted with an FBS-free solution of 1.0 
x 10" 7 M human insulin and the CHO-HIR cells were stimulated 
through incubation (at 37°C) for 72 hours, 3 hours or 5 minutes. 
The insulin solution of the cells incubated for 5 minutes was 
then incubated for 175 minutes in an FCS-free culture containing 
no insulin. 

Fig. 9 shows the emission intensity, which is represented 
by RLU, after the insulin stimulation. 

The corrected luciferase activity of the CHO-HIR cells 
stimulated with insulin for 72 hours, 3 hours and 5 minutes was 
1.0, 0.73, and 0.18, respectively. The emission intensity 
(background) of the cells incubated in the insulin-free culture 
was 0.15, and was found to be almost the same as that of the 
cells stimulated with insulin for 5 minutes. 

Furthermore, the emission intensity, which was represented 
by RLU, of the cells stimulated with insulin for 3 hours or 
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longer was found to be more than 4 times that of the background 
value. The kinase activity of the cells observed herein resulted 
from the specific interaction between SH2N and the peptide Y941 
(the 941st amino acid of which is tyrosine) phosphorylated with 
an insulin receptor. This in-trans folding and splicing of DnaE 
occurred effectively in probe <2> , for protein-protein 
interaction, LucN and LucC were linked, and luciferase emission 
was reproduced. 

An Y941 mutant, in which the tyrosine residue in Y941 was 
substituted with an alanine residue that was not phosphorylated 
with the insulin receptor, was then constructed and tested in the 
same manner as described above. 

The results are shown in Fig. 10. 

In the CHO-HIR cells where the Y941 mutant was expressed, 
the corrected luciferase emission was on the same level as that 
of the background. This confirmed that the protein interaction 
with SH2N was due to phosphorylation of tyrosine in Y941. 

Example 7: Effectiveness 2 of probe <2> for protein-protein 
interaction analysis 

Quantitative analysis of insulin-induced protein-protein 
interaction in CHO-HIR cells was conducted by using probe <2> for 
protein-protein interaction analysis constructed in Example 5. 

First the insulin concentration dependence of the RLU value 
was confirmed. After the CHO-HIR cells were coexpressed by 
pIRES-DSL (Y/S) and pRL-TK in the same manner as described above, 
the cells were stimulated at 37°C for 3 hours with insulin of 
various concentrations between 1.0 x 10" 13 and 1.0 x 10" 7 M by the 
above-described method. 

Fig. 11 shows the RLU. 

The emission intensity of the cells was confirmed to 
increase with insulin concentration. 
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INDUSTRIAL APPLICABILITY 

As described in detail hereinabove, the present invention 
provides a probe for protein-protein interaction analysis that 
enables accurate and simple analysis of protein-protein 
interactions. The invention also provides a method for the 
analysis of protein-protein interactions using the probe for 
protein-protein interaction analysis. 

The present method for analysis of protein-protein 
interactions does not require any reporter gene or substrate as 
in the conventional methods, and enables simple and accurate 
analysis. Furthermore, protein-protein interactions can be also 
analyzed with high-sensitivity in mammal cells. 

Accordingly, the probe for protein-protein interaction 
analysis in accordance with the present invention enables fast 
and simple analysis of the mechanisms of protein-protein 
interactions in various organisms, such as protein-protein 
interactions in cell membranes and receptor activation by 
hormones inside the cells. 
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